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NEW REPOR TER GENES 

This invention relates to gene expression in 
eukaryotic organisms, and more particularly to novel 
reporter genes for use in such systems and to their use. 
5 In eukaryotic organisms, gene expression is mainly 

controlled by the transcription mechanism. RNA polymerase 
II catalyzes the synthesis of mRNA from the structural DNA 
genes through the formation of a complex with specific 
target DNA sequences (promoters) and protein factors which 
10 are responsible for the correct initiation of the 

transcription. Promoters for RNA polymerase II contain a 
variety of short cis-acting DNA sequences which are 
specifically recognised by transacting protein factors. 
These modular DNA sequences are located mainly upstream of 
15 the TATA box (when present) but can be placed in both 
orientations and at very variable distances from the 
transcription start site. The proximal region to this site 
is responsible for correct initiation of transcription, 
while the regions further upstream determine the 
20 transcriptional efficiency of the overall promoter. Some 
of these promoter sequences are contained in several genes 
and are recognised by widely present protein factors. 
Other promoter sequences, on the other hand, are 
characteristic of specific groups of genes and interact 
25 with specific transcription factors: they are called 
"responsive elements" (RE) . 
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The ability to determine the transcripti nal 
activity of cloned genomic sequences after their 
introduction into appropriate cells has substantially 
enriched understanding of how mammalian gene transcription 
5 is regulated* Such studies have been greatly simplified by 
the use of vectors containing reporter genes. 

Reporter genes code for proteins which possess a 
unique enzymatic activity or are otherwise easily 
distinguishable from the mixture of intra- and extra- 

10 cellular proteins produced by the expression system under 
study* The basic strategy for analysing the efficiency of 
gene transcription using reporter genes is as follows. To 
test the efficacy of the promoter elements of the 
expression system, the DNA under test is ligated upstream 

15 of the coding region of the reporter gene so as to produce 
a chimeric gene in which the putative regulatory elements 
control the expression of the reporter gene. Similarly, to 
test for functional enhancer elements, which by definition 
can increase transcription when placed upstream or 

20 downstream and in either orientation relative to the 
promoter, the test DNA can be ligated upstream or 
downstream of a reporter gene which already is associated 
with a promoter. The fused genes so obtained are 
subsequently introduced, by any of a variety of techniques, 

25 into cultured cells or into animals, e.g. to produce 

transgenic mice. The transcriptional capability of the DNA 
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under t st is then estimated quantitatively from the in 
XitES activity of the reporter gene product in the culture 
medium or in cellular or tissue-derived extracts, or is 
determined qualitatively or by histochemical techniques of 
5 intact cells. 

The basic technique may be varied to take account 
of the particular system being studied. Although the 
activity or quantity of the reporter gene product is an 
indirect measurement of the transcriptional properties of 
10 the DNA under test, generally the reporter gene activity is 
directly proportional to the transcriptional activity. 

The vectors containing reporter genes used up till 
now have certain properties in common. They often have a 
backbone derived from pBR322 or puc, which allow for 
15 propagation and selection in Escherichia r,m< In 
addition, such vectors generally possess a eucaryotic 
poly (A) addition signal and an intron, either intrinsic to 
the reporter gene or derived from a heterologous source. 
The presence of an intron is necessary for efficient 
20 production of some mature cytoplasmic mRNAs. Unique 
cloning sites, located upstream or downstream of the 
reporter gene, are designed for easy insertion of promoter 
and RE sequences to be tested. 

Up till now three main types of reporter genes have 
25 generally been used: 

(a) Chloramphenicol >npt Y if. ^ rn - r 0ne widely used 
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reporter gene codes f r chl ramph nicol ac tyltransferas 
(CAT) in E. coli (l). The CAT gene is not endogenous to 
mammalian cells and the enzyme which it produces can be 
easily assayed and its activity detected at low levels. 
5 CAT catalyses the transfer of acetyl groups from acetyl 
coenzyme A (acetyl-CoA) to chloramphenicol. Mono- and 
diacetyl forms of chloramphenicol can be produced. Since 
acetylated and unacetylated forms of chloramphenicol 
possess different solubilities in organic solvents, they 
10 can be separated by silica gel thin-layer chromatography 
(TLC) or by differential extraction in organic solvents. 
Moreover , the acetylated forms can be radiolabelled by 
incubating ( 14 C) chloramphenicol with acetyl-CoA, or, less 
expensively, incubating ordinary chloramphenicol with 
15 ( 3 H)acetyl-CoA in the test reaction. The amount of 

acetylation can be measured by densitometry or by liquid 
scintillation counting of the appropriate region of the 
silica gel after TLC or the organic phase after 
differential extraction. 
20 (b) Lucif erase . The DNA sequence coding firefly 
f Photinus pvralis ) luciferase can be used in a very 
versatile reporter gene system (2,3). Luciferase catalyses 
the reaction: 

Luciferin + ATP + 0 2 -* oxyluciferin + AMP + PPi + C0 2 + light 
25 Under appropriate conditions, both the maximum light 

intensity and the total light output ov r a period of time 
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are proportional to the concentration of luciferase in the 
reaction system. The light emission can be measured by a 
luminometer or by liquid scintillation. 

< c > Pe*a-t?alactosjdase . 0-galactosidase from E. coli 
5 is a tetramer with a subunit size of 1023 amino acids. It 
catalyses the hydrolysis of various /9-galactosides, 
including lactose. Measurement of /3-galactosidase activity 
provides a useful internal control for measuring the 
variability in reporter enzyme activity caused by 
10 differences in transfection efficiency or cell extract 
preparation (4). A simple photometric assay method may be 
used to measure hydrolysis of the substrate ONPG (o- 
nitrophenyl 0D-galactopyranoside) by 0-galactosidase in a 
cell-free system. The recent discovery of substrates for 
15 0-galactosidase which generate fluorescent products has 
significantly increased the sensitivity of this assay. 

Nevertheless, there are serious disadvantages in 
all the known reporter gene systems: 

(1) Although the luciferase system is very sensitive, 
20 the luminometer required to measure the enzyme activity is 

expensive and not very widely available; 

(2) mRNA from the reporter gene and consequently 
enzymatic activity, can be generated by read-through 
transcription or by spurious initiation of the 

25 transcription from within the plasmid-associated 

procaryotic sequences, it has been demonstrated that many 
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plasmid c nstructions containing CAT as reporter gene may 
drive transcription initiation at several unspecific sites, 
leading to spuriously increased CAT activity. In addition, 
the assay method is quite complicated and not easy to 
5 reproduce. Use of radio labelled compounds requires 
special laboratory licences. 

(3) Many cells or tissues (e.g. epithelial cells) 
endogenously produce 0-galactosidase. Precise measurement 
of exogenous 0-galactosidase activity can therefore be 

10 difficult. 

The present invention provides a novel class of 
reporter genes, the use of which avoids at least some of 
the difficulties already mentioned. A recombinant protein 
or peptide expression system in accordance with the present 

15 invention comprises, as a reporter gene, a gene coding for 
an enzyme or other protein which is stable under 
conditions, e.g. of temperature, in which at least some, 
and preferably all, of the other protein or peptide 
products of the said system are denatured or otherwise 

20 inactivated. The expression system includes, in addition 
to the reporter gene, a promoter and/ or other DNA sequences 
which do not encode a protein or peptide product but assist 
in the expression of the reporter gene. The use of genes 
coding for highly thermostable enzymes represents a novel 

25 and advantageous strategy for the study of the efficiency 
of expression of eucaryotic cis-acting genomic sequences. 
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Using reporter genes which give rise to products which are 
intrinsically stable to temperature and protein denaturants 
allows the quick and efficient removal, by suitable 
selective denaturation, of any interfering proteins which 
5 are endogenously expressed by the host cells. 

Use of reporter genes encoding for enzymes or other 
proteins which are stable under extreme conditions offers 
the following advantages: 

(a) The new systems may be used in cell systems which 
10 endogenously express high levels of an interfering enzyme 

(e.g. the endogenous /9-gal activity of epithelial cells) 
which is not stable under extreme conditions. 

(b) since background enzymatic activity can be removed, 
the accuracy and sensitivity of the assay for the product 

15 of the reporter gene is significantly increased. 

(c) The detection system can be made to have high 
specificity which can be as high as that of a direct 
measure of the mRNA corresponding to the reporter gene. 

Many genes encoding thermostable enzymes are 
20 already known, especially those present in extremophile 
achae-bacteria. The thermostable enzymes have been well 
characterized (5,6,7,8) and the corresponding genes coding 
for them are available (9). The gene (lacS) encoding a 0- 
galactosidase (SsB-Gal) from the archeabacterium Sulfolohn* 
25 solfataricus is preferably used in the present invention. 
(This microorganism, strain MT4 , is available from, for 
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example , ATCC and DSM) . Th enzyme produced by this gene 
has been purified to homogeneity and extensively 
characterized (8) . A gene library of the microorganism has 
been constructed in lambda gtll and the 0-galactosidase 
5 gene isolated, cloned (9) and expressed in E. coli (10). 
The potency of lacS as a reporter gene in 
eukaryotic systems has been demonstrated in cultured 
mammalian cells. For this purpose, four different 
eukaryotic expression vectors, suitable for transient or 

10 stable trans fection have been constructed. All four 

vectors contain strong enhancer/promoter sequences which 
allow high expression of the cloned gene. Figures 1 and 2 
are plasmid maps of the four vectors which have been made. 
The first two plasmids suitable for transient transfection 

15 (see Figure 1) were designed as follows. 
pSV 3 ~lacS 

This plasmid was constructed by substituting the 
Hind III/Pvu II fragment of the CAT gene in pSV 2 -CAT(l) 
using a Hind III fragment containing all the coding 
20 sequence of the lacS gene. 
pRSV-lacS 

The Bam HI region (containing the lacS gene and the 
SV40 polyadenylation site and intron) from pSV 2 -lacS was 
inserted in the Hind III/Bam HI digested pRSV-neo(l) , 
25 replacing the neo coding sequence by the sequence 
containing the lacS gene. 
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Two more plasmids w re constructed for the 
selection of stable transf ectants . These are shown in 
Figure 2. Both the plasmids possess the neo gene under the 
control of the SV40 enhancer/promoter system. pRC/RSV-lacS 
5 and pRC/CMV-lacS were constructed by inserting the Hind III 
fragment of Lacs at the Hind III site of the polylinkers 
of, respectively, pRC/Rsy and pRC/CMV vectors obtained from 
INVITROGEN. 

Simian CV1 and mouse fibroblast NIH3T3 cell lines 
10 were used as host systems. They were transf ected by the 
calcium/phosphate coprecipitation technique. For transient 
invention, cells were harvested 48 hours after DNA addition 
and cell lysates were prepared and assayed for enzymatic 
activity. 

15 For stable transfection CVl cells transfected with 

pRC/CMV-lacS were selected by growth in the presence of 
selection marker G418. To assay the expressed Ss 0-gal 
activity, cell extracts were preincubated at 70-c for ten 
minutes in the presence of o.l% SDS to inactivate any 
20 endogenous enzymatic activity of the transfected cells. 
The assay was then performed by the method of Maniatis et 
fil (11) but at 75'C rather than 37»c. 

The results of the transient transfection 
experiments and the expression of the thermostable enzymes 
25 in the two cell lines are shown in Figure 3. Non- 

transfected NIH3T3 and CVl cells were used as negative 
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controls giving background activity valu s. Ss £-gal 
activity was reported in relation to cell number and 
normalised with respect to CAT activity determined in cells 
which were cotransfected with the pSV 2 CAT plasmid. NIH3T3 
5 cells were transfected with 2/xg of pSV 2 -CAT or 

alternatively with 2 or 20 fig of pRSV-lacS or pRC/RSV-lacS • 
CVl cells were cotransfected with the same amount of the 
reference plasmid or with 2 or 20/xg of pSV2~lacS or 
pRC/CMV. 

10 The expression system of the present invention is 

conveniently provided in the form of a plasmid or other 
vector comprising the said system, preferably with a 
polylinker sequence upstream of the reporter gene to 
facilitate cloning of a desired promoter region to be 

15 tested. As described above such plasmids may be used in 
transient or stable transfection. For control purposes, 
plasmids or other vectors may be provided having an 
expression system in which the same reporter gene is under 
the control of a promoter sequence of known potency. 
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CIAIMS 

1. A recombinant protein or peptide expression 
system comprising, as reporter gene, a gene coding for an 
enzyme or other protein which is stable under conditions in 
which at least some of the other protein or peptide 
products of the said system are inactivated. 

2. An expression system according to claim l in 
which the said gene codes for an enzyme which is stable 
under conditions of temperature in which substantially all 
of the other protein or peptide products of the said system 
are denatured. 

3. An expression system according to claim 2 in 
which the said gene codes for a thermostable /?- 
galactosidase of Bulfoiaw P oif a t a r< mw , 

4. An expression system according to claim 3 in 
which the said gene is the lacS gene. 

5. An expression system according to any one of 
claims 1 to 4 which also includes a promoter and/or other 
DNA sequence which does not encode a protein or peptide 
product but assists in the expression of the reporter gene. 

6. A plasmid or other vector comprising an 
expression system as claimed in any of claims l to 5. 

7. A plasmid or other vector according to claim 6 
having a polylinXer sequence upstream of the reporter gene. 
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